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This study evaluates the potential of near-infrared Raman spec- 
troscopy for in vivo detection of squamous dysplasia, a precursor 
to cervical cancer. A pilot clinical trial was carried out at three 
clinical sites. Raman spectra were measured from one colposcopi- 
cally normal and one abnormal area of the cervix. These sites were 
then biopsied and submitted for routine histologic analysis. Twenty- 
four e valuable measurements were made in vivo in 13 patients. Cer- 
vical tissue Raman spectra contain peaks in the vicinity of 1070, 
1180, 1195, 1210, 1245, 1330, 1400, 1454, 1505, 1555, 1656, and 1760 
cm" 1 . The ratio of intensities at 1454 to 1656 cm -1 is greater for 
squamous dysplasia than all other tissue types, while the ratio of 
intensities at 1330 to 1454 cm -1 is lower for samples with squamous 
dysplasia than all other tissue types. A simple algorithm based on 
these two intensity ratios separates high-grade squamous dysplasia 
from all others, misclassifying only one sample. Spectra measured 
in vivo resemble those measured in vitro. Cervical epithelial cells 
may contribute to tissue spectra at 1330 cm" 1 , a region associated 
with DNA. In contrast, epithelial cells probably do not contribute 
to tissue spectra at 1454 cm -1 , a region associated with collagen and 
phospholipids. 

Index Headings: Raman spectroscopy; Cervix; In vivo study. 



INTRODUCTION 

With the advent of molecular medicine, there will be 
an unprecedented opportunity for interventions based on 
early detection of disease. Optical technologies offer real- 
time assessment of biochemical and morphologic tissue 
composition, and provide an important new tool to im- 
prove early detection. Many groups have shown the 
promise of optical technologies for early detection of nu- 
merous pathologies, including atherosclerosis 1 * 2 and dys- 
plasia. 34 

The uterine cervix is a well-established clinical and 
histopathologic model of cancer progression. Our group 
has conducted a number of studies to evaluate emerging 
optical technologies for the in vivo detection of cervical 
dysplasia, including fluorescence spectroscopy, 5 confocal 
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imaging, 6 and optical coherence tomography (OCT). 7 Di- 
agnostic technologies in medicine are compared by using 
sensitivities, specificities, positive and negative predictive 
values, and receiver-operator characteristic curves. 8 
Emerging optical technologies show promise for cervical 
precancer detection as measured against traditional tech- 
nologies such as the Papanicolaou smear and colposcop- 
ically directed biopsy, with sensitivities and specificities 
of optical techniques approaching 80-90% in small and 
moderate sized clinical trials. 9 

The biological basis of these optical technologies is 
under intensive study. Reflectance spectroscopy samples 
electronic-energy-level differences in tissue chromo- 
phores, such as hemoglobin and proteins. Fluorescence 
spectroscopy also is sensitive to electronic-energy-level 
transitions; native tissue fluorophores include the aro- 
matic amino acids, crosslinks in collagen, and the cofac- 
tors NAD(P)H and FAD. Imaging techniques, such as 
confocal microscopy and OCT, are sensitive to small dif- 
ferences in tissue refractive index. Recently, Raman spec- 
troscopy has been explored as a new diagnostic tech- 
nique. 1011 Raman spectroscopy is sensitive to vibrational- 
energy-level differences. A large number of biological 
molecules can be probed by using Raman spectroscopy, 
with many having characteristic fingerprint Raman spec- 
tra. 

Several investigators have explored the potential of 
near-infrared Raman spectroscopy for detection of cancer 
and precancer in vitro. Feld and colleagues showed that 
in vitro Raman spectra of normal breast tissue were dom- 
inated by peaks characteristic of fatty acids at 1667, 1452, 
and 1300 cm -1 whereas Raman spectra of tumors were 
dominated by structural protein modes at 1667, 1452, 
1260, 890, and 820 cm" 1 . 12 McCreery and co-workers 
showed that in vitro Raman spectra could classify normal 
and cancerous breast tissue specimens based on the in- 
creased signatures associated with cholesterol and lipid 
relative to protein. 13 Our group measured Raman spectra 
of normal and dysplastic cervical tissue in vitro; simple 
but effective algorithms could be developed to discrimi- 
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nate dysplasia from normal cervix tissue based on these 
spectra. 14 As dysplasia developed, the ratio of Raman in- 
tensities at 1656 cm -1 to that at 1330 cm -1 decreased. 
The ratio of Raman intensities at 1454 cm" 1 to that at 
1656 cm" 1 was higher for high-grade dysplasias than for 
low-grade dysplasias. Chromophores that contribute to 
these Raman peaks include collagen (1454 cm -1 , 1656 
cm -1 ), phospholipids (1330 cm -1 , 1454 cm" 1 ) and DNA 
(1330 cm" 1 ). A challenge in these in vitro studies was 
that Raman spectra of biological samples are typically 
very weak; achieving high signal-to-noise ratios and 
avoiding background signals produced by the collection 
optics is difficult. 

Achieving a high signal-to-noise ratio and avoiding 
background signals is even more difficult in vivo, because 
the fiber-optic probes used to collect in vivo signals have 
high Raman signatures in the fingerprint region and in- 
tegration times and irradiation powers must be limited 
for practical and safety reasons. Recently, several groups 
have presented designs for fiber-optic probes that allow 
measurement of Raman spectra in vivo. 15 ' 19 In this study, 
we present the results of an in vivo pilot study using 
Raman spectroscopy to classify cervical precancer where 
histopathologic biopsy was used as the gold standard. We 
then compare these results to our previous in vitro work. 14 

Methods. Clinical Trial Design. A Pap smear is nor- 
mally obtained by a women's gynecologist or a family 
physician and analyzed at a cytology laboratory. If results 
are abnormal, further examination is necessary in a col- 
poscopy clinic where the cervix is examined with a low- 
power microscope and optical-contrast agents. If abnor- 
mal areas are identified they are biopsied and a treatment 
decision is formed when histological results are available. 

The clinical trial was carried out at three sites: The 
University of Texas M.D. Anderson Cancer Center, the 
Lyndon Baines Johnson Harris County Health District, 
and the Hermann Medical Center. Pilot studies at The 
University of Texas at M.D. Anderson, The University of 
Texas Health Science Center, and the Harris County 
Health District Internal Review Boards at the time of the 
clinical trial allowed for up to 25 patients to be entered 
into the trial. Each patient was informed of the study and 
was asked to sign an informed consent. Eligible patients 
included those referred with the diagnosis of an abnormal - 
Pap smear who were over 18 years old and not pregnant. 
All patients in the Colposcopy Clinic undergo a complete 
history and physical examination, sexually transmitted 
disease screening, a Papanicolaou smear, pan-colposcopy 
of the vagina, vulva and cervix, and colposcopically di- 
rected biopsies of diseased areas. 

After the cervical colposcopy, normal and abnormal 
areas were identified, and Raman spectra were measured 
from one normal and one abnormal area of the cervix. 
Each of these sites was then colposcopically biopsied. 
Biopsies were submitted for routine histologic analysis 
and read by a gynecologic pathologist. The pathologic 
categories were normal cervix, inflammation, squamous 
metaplasia, low-grade squamous dysplasia (HPV and 
CIN 1), high-grade squamous dysplasia (CIN 2, CIN 3, 
and CIS), and cancer. The pathologist was blinded to the 
results of the Raman spectroscopic study. 

Instrumentation. Figure 1 shows a block diagram of 
the system used to collect Raman spectra from cervical 
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Fig. 1. Block diagram of system used to measure Raman spectra in 
vivo. 



epithelium in vivo. The system has been described in de- 
tail previously. 16 Briefly, it consists of a diode laser at 
789 nm coupled to a fiber-optic delivery and collection 
probe. The probe directs the illumination light onto the 
cervix and the resulting Raman scattered light onto a ho- 
lographic spectrograph coupled to a liquid nitrogen 
cooled, back-illuminated, deep depletion, charge-coupled 
device (CCD) camera. The probe is optimized to measure 
epithelial tissue layers because the illumination and col- 
lection path cross with an angle of 75° at the tissue site, 
thus limiting the interrogated tissue volume to the front 
of the probe. The probe head (12 mm diameter) contains 
a bandpass filter to remove fluorescence from the exci- 
tation path and a holographic notch filter to suppress the 
reflected excitation light in the collection path. 

The laser output was 15-16.5 mW at the distal end of 
the fiber-optic probe. Prior to each patient measurement, 
the probe was disinfected in gluteraldehyde, rinsed in wa- 
ter, and wiped with alcohol. The spectrum of a neon cal- 
ibration lamp was measured for wavelength calibration 
purposes. A Raman spectrum was then measured from 
naphthalene and rhodamine calibration standards. A 
background spectrum was measured, with the probe cov- 
ered by a black plastic cap. Finally, the probe was ad- 
vanced through the speculum and placed in contact with 
a colposcopically normal and abnormal site on the cervix. 
Raman spectra were measured from these two sites; in- 
tegration times ranged from 60 to 180 s. The room light 
was turned off during measurements or where this was 
not possible a black cloth covered the pelvis and legs of 
the patient. Following spectral measurement, each site 
was biopsied. 

Data Processing and Analysis. Signal from the CCD 
camera was binned along one direction, to create a single 
spectrum per measurement site. From published neon 
lines, 20 we identified a subset of 12 lines that were not 
saturated and detectable in all of the calibration measure- 
ments. An automated routine located the neon peaks 
within a predefined window and fit them to a polynomial 
function of 2 nd degree. This created a relationship be- 
tween wavelength and pixel number. In a second step, 
peaks outside an offset of 0.3 nm from the expected lo- 
cation were removed and the same polynomial fit was 
repeated. This gave a calibration accuracy of approxi- 
mately 4 cm -1 while the spectral resolution was 8 cm -1 . 
For each measurement, wavenumbers were generated 
based on the center wavelength of the laser emission. 
Each spectrum was truncated below 1000 cm' 1 . Cosmic 
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TABLE I. Histopathologic diagnosis. 



Patient 


Colposcopically 


Colposcopically 


number 


normal area 


abnormal area 


9 


Squamous metaplasia 


High-grade dysplasia 


1 1 


Normal 


Inflammation 


12 


Norm al 


Not measured 


15 


Normal 


Signal-to-noise ratio too 






low to analyze 


16 


Normal 


Squamous metaplasia 


17 


Normal 


Inflammation 


18 


Normal 


Inflammation 


20 


Normal 


Inflammation 


21 


Normal 


Squamous metaplasia 


22 


Normal 


Low-grade dysplasia 


23 


Normal 


Squamous metaplasia 


24 


No normal areas available 


Two sites-high-grade dys- 






plasia 


25 


Normal 


High-grade dysplasia 



rays and defect pixels were removed from the spectra; 
missing data were replaced with linearly interpolated val- 
ues between the neighboring pixels. 

Each spectrum consisted of a small contribution of Ra- 
man scattering atop a large fluorescence background. The 
fluorescence background varies much more smoothly 
with wavelength, and thus was removed by using a high- 
pass Butterworth filter with a cut-off frequency equiva- 
lent to 2% of half the sampling frequency. 21 The remain- 
ing high-frequency content of the spectrum containing 
the Raman signal was smoothed with a Savitsky-Golay 
filter 22 by using a polynomial function of 1 st degree and 
a window width of 5 pixels, which corresponded to the 
system spectral resolution. 

Raman spectra were grouped according to histopatho- 
logic findings and average spectra were calculated. These 
average spectra were examined visually to identify a set 
of Raman peaks common to most spectra. Then, in the 
vicinity of these peaks an automated local maxima search 
routine was applied. Additionally an automated local 
minima search routine was applied in the same regions. 
The lowest two local minima on both sides of each local 
maximum were selected and averaged. The peak intensity 
was determined by the difference between the local max- 
imum and the average local minimum. 

For further analysis, peak intensities were normalized 
to the common four strongest Raman bands, yielding five 
datasets: unnormalized peak intensities and four sets of 
intensity ratios. To identify those peaks and peak ratios 
corresponding to the most significant differences between 
tissue types, a student's t-test was applied to test the hy- 
pothesis that the mean values of the peak intensities and 
ratios were different for all types of tissue. For the peaks 
and ratios with a p-value below 0.1, scatterplots showing 
pairwise combinations of intensity values and ratios for 
all samples were plotted. From the two-dimensional scat- 
ter plots, linear discriminators were formed relative to the 
gold standard of histopathology. The number of correctly 
classified samples was reported. Principal component 
analysis 14 - 23 and cross validation techniques were not ap- 
plied to the data because the sample size was not suffi- 
ciently large. 24 

RESULTS 

Twenty-five patients consented and entered the study 
at the combined three sites. The patients ranged in age 
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Fig. 2. Typical Raman spectra of a colposcopically normal area of the 
cervix with metaplasia and a colposcopically abnormal area with high- 
grade squamous dysplasia. 



from 18 to 76 years, and the average age was 32 years. 
Thrity-six percent were white, 36% were African-Amer- 
ican, and 28% were Hispanic. All the patients entered the 
Colposcopy Clinic with an abnormal Pap smear. Data 
from the first six patients in the study were excluded from 
further analysis, as this phase of the study was used to 
optimize hardware settings. Data from an additional three 
patients were excluded due to instrument malfunction. 
Data from one patient was excluded because pathology 
was unavailable because the histopathologic site location 
was not recorded. Data from two patients were excluded 
because interference from room lights resulted in a sig- 
nal-to-noise ratio of approximately 1. Thus, 24 evaluable 
measurements were made in 13 patients; the subset of 
patients excluded differed in no way from those included 
in further analysis. The results of the histologic biopsies 
are presented in Table I. 

Figure 2 shows typical processed spectra from patient 
9, a patient in whom a colposcopically normal area with 
histologic evidence of metaplasia (a normal finding in the 
cervix) is compared to a colposcopically abnormal area 
with histopathologic diagnosis of high-grade squamous 
dysplasia. The data-processing method removes the 
broadband autofluorescence, and in this subtraction pro- 
cess the baseline of the spectrum drops below zero. Be- 
cause of this, peak intensities are calculated by taking the 
intensity at the peak and subtracting the intensity at the 
surrounding minima. Raman peaks are present in the vi- 
cinity of 1070, 1180, 1210, 1245, 1270, 1330, 1400, 
1454, 1580, and 1656 cm" 1 . Note that the ratio of inten- 
sities at 1454 cm -1 to that at 1656 cm -1 is higher in the 
spectrum of the dysplastic sample (1 .43) compared to that 
in the spectrum of the metaplastic sample (1.26). The 
ratio of intensities at 1656 cm" 1 to that at 1330 cm" 1 is 
decreased in the dysplastic sample (1.23 vs. 1.27), which 
is consistent with earlier results measured in vitro.™ Also, 
the ratio of intensities at 1330 cm -1 to that at 1454 cm* 1 
is lower in the dysplastic sample (0.56) compared to that 
in the metaplastic sample (0.63). 

Figure 3 shows average spectra for each diagnostic cat- 
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Fig. 3. Average Raman spectra of each diagnostic category. The av- 
erage intensity ratios at 1454 to 1650 cm -1 are higher for squamous 
dysplasia (2.26) than for normal samples (1.85), samples with inflam- 
mation (1.78), and metaplastic samples (1.48). Similarly, the average 
intensity ratios at 1330 to 1454 cm" 1 are lower for squamous dysplasias 
(0.55) than for normal samples (0.60), samples with inflammation 
(0.72), and metaplastic samples (0.99). 

egory: normal, inflammation, metaplasia, and squamous 
dysplasia. As the diagnosis progresses from normal to 
inflammation to metaplasia and through squamous dys- 
plasia, the intensity of the Raman peak at 1454 cm -1 
increases. The intensity of the Raman peaks at 1330 and 
1656 cm -1 is greater for abnormal samples than normal 
samples. The ratio of intensities at 1454 to 1656 cm -1 is 
greater on average for squamous dysplasia samples than 
all other tissue types, while the ratio of intensities at 1330 
to 1454 cm" 1 is lower on average for samples with squa- 
mous dysplasia than all other tissue types. 

By examining all data we found that Raman peaks are 
present within 1070 ± 25, 1182 ± 10, 1195 ± 35, 1217 
± 7.5, 1250 ± 7.5, 1330 ± 25, 1400 ± 7.5, 1454 ± 17.5, 
1505 ± 7.5, 1555 ± 35, 1656 ± 30, and 1760 ± 20 cm" 1 . 
The peaks with the strongest signal were found at 1070, 
1330, 1454, and 1656 cm -1 . Table II shows the p-values 
resulting from the statistical comparison of average in- 
tensity values at those bands and the intensities normal- 
ized to the strongest peaks. Only the intercategory com- 
parisons with p-values less than or equal to 0.05 were 
included in this table. Note that for differentiating squa- 
mous dysplasia from normal tissue, inflammation and 
metaplasia, ratios of intensities at 1454 to 1656 cm -1 and 
1330 to 1454 cm -1 showed low p-values. 

Figure 4 shows diagnostic algorithms derived from 
these data that correspond to these ratios. This scatterplot 
indicates the intensity of each of the 24 sites measured 
by diagnostic category at three frequencies. Figure 4 
shows the intensity ratio at 1330/1454 cm -1 plotted 
against the ratio of intensities at 1454/1656 cm -1 . The 
linear discriminator, represented by the straight line, sep- 
arates high-grade squamous dysplasia from all others, 
misclassifying only one normal sample. 

DISCUSSION AND CONCLUSION 

This pilot study shows that it is possible to measure 
Raman spectra in vivo and extract potentially diagnosti- 
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TABLE II. Results of pairwise comparison of average intensities. 

Wavenumber 



Diagnostic category (cm -1 ) p-value 



riv«n!ncifl v« inflammation 


1 182/1070 


0.01 


riv^nlacia vc inflammation 

U y sUluiltt VI. Ill 11 alll Ulallvsil 


1400/1070 


0.01 


T^vcnlacta vc mptanla^ia 

\J y iUloola vs. lliciaplaola 


1656/1454 


0.01 


Dysplasia vs. normal 


1195/1070 


0.02 


Dysplasia vs. inflammation 


1195/1070 


0.02 


Dysplasia vs. normal 


1454/1656 


0.02 


Dysplasia vs. inflammation 


1454/1656 


0.03 


Dysplasia vs. metaplasia 


1182/1070 


0.04 


Dysplasia vs. normal 


1454/1330 


0.04 


Dysplasia vs. inflammation 


1400/1454 


0.04 


Dysplasia vs. normal 


1182/1454 


0.05 


Normal vs. inflammation 


1760/1070 


0.05 



cally useful information. Spectra measured in vivo resem- 
ble those measured in vitro. There are obvious visual dif- 
ferences in the spectra of normal cervix and high-grade 
squamous dysplasia in the same patient. Average spectra 
reveal a consistent increase in the Raman intensity at 
1330, 1454, and 1650 cm -1 as tissue progresses from 
normal to high-grade squamous dysplasia. These peaks 
are consistent with contributions from collagen, phospho- 
lipids, and DNA. However, because tissue is a complex, 
heterogeneous structure, definitive assignment is difficult. 
To assess whether epithelial cell signal contributes to tis- 
sue Raman spectra, we measured Raman spectra of a sus- 
pension of cervical cancer cells suspended in phosphate 
buffered saline at a concentration of 10 6 cells per mL with 
the use of a system described previously. 14 Results are 
shown in Fig. 5 and indicate that epithelial cells may 
contribute to tissue spectra at 1330 cm" 1 , a region asso- 
ciated with DNA. In contrast, epithelial cells probably do 
not contribute to tissue spectra at 1454 cm" 1 , a region 
associated with collagen and phospholipids. A dominat- 
ing peak for cells in suspension was found at 1656 cm -1 . 
Our results show that intensities at 1330, 1454, and 1656 
cm -1 can be used to discriminate high-grade squamous 
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Fig. 4. This scatterplot indicates the intensity of each of the 24 sites 
measured by diagnostic category at three frequencies; the intensity ratio 
at 1330/1454 cm -1 is plotted against the ratio of intensities at 1454/ 
1656 cm" 1 . The straight-line algorithm separates high-grade squamous 
dysplasia from all others, misclassifying one normal sample. 



2 1 : 1 1 1 1 1 r 




I 5 1 1 1 1 «- 1 ■ ! — 

1000 1100 1200 1300 1400 1500 1600 1700 

Wavenumber shift [1/cmJ 



Fig. 5. Raman spectra of a suspension of cervical cancer cells. 

dysplasia from all other samples with minimal classifi- 
cation errors. Similar algorithms were previously found 
to be useful in identifying high-grade squamous dysplasia 
in an in vitro study. 14 

The limitations of this study are that it is a pilot study 
with a small number of patients. Our experience shows 
that pilot studies are very useful in the development of 
emerging technologies; however, larger clinical trials are 
required to confirm these results. Improvements in hard- 
ware, measurement conditions, and training clinical staff 
to optimally participate in data collection are necessary 
to allow these larger trials. In particular, reducing the in- 
tegration time and enabling measurements to be made 
under normal room lighting conditions would facilitate 
larger trials. Because these problems can be solved with 
increased excitation power and further optimized probe 
configurations, 25 Raman spectroscopy offers an attractive 
tool for surveying the biochemical changes that accom- 
pany the development of dysplasia. 
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